Empirical study on clique-degree distribution of networks 
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The community structure and motif-modular-network hierarchy are of great importance for un- 
derstanding the relationship between structures and functions. In this paper, we investigate the 
distribution of clique-degree, which is an extension of degree and can be used to measure the den- 
sity of cliques in networks. The empirical studies indicate the extensive existence of power-law 
clique-degree distributions in various real networks, and the power-law exponent decreases with the 
increasing of clique size. 
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The discovery of small-world effect [l| and scale-free 
property 0] triggered off an upsurge in studying the 
structures and functions of real-life networks 0, M S 
B 01- Previous empirical studies have demonstrated 
that most real-life networks are small- world Q, that 
is to say, it has very small average distance like com- 
pletely random networks and large clustering coefhcient 
like regular networks. Another important characteris- 
tic in real-life networks is the power-law degree distri- 
bution, that is p{k) oc k^'^, where k is the degree and 
p{k) is the probability density function for the degree dis- 
tribution. Recently, empirical studies reveal that many 
real-life networks, especially the biological networks, are 
densely made up of some functional motifs ITll |. 
The distributing pattern of these motifs can reflect the 
overall structural properties thus can be used to clas- 
sify networks [T3|. In addition, the networks' functions 
are highly affetced by these motifs jl^l- A simple mea- 
sure can be obtained by comparing the density of motifs 
between real networks and completely random ones , 



TABLE I: The basic topological properties of the present 
seven networks, where A'^, M, L and C represent the total 
number of nodes, the total number of edges, the average dis- 
tance, and the clustering coefficient, respectively. 

networks/measures N M L C 

Internet at AS level 10515 21455 3.66151 0.446078 

Internet at routers level 228263 320149 9.51448 0.060435 

the metabolic network 1006 2957 3.21926 0.216414 

the world-wide web 325729 1090108 7.17307 0.466293 

the collaboration network 6855 11295 4.87556 0.389773 

the ppi-yeast networks 4873 17186 4.14233 0.122989 

the friendship networks 10692 48682 4.48138 0.178442 
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however, this method is too rough thus still under debate 
now [13, [l^. In this paper, we investigate the distribu- 
tion of clique-degree, which is an extension of degree and 
can be used to measure the density of cliques in networks. 
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FIG. 1: Illustration of the clique-degree of node i. ki = 7, 
fcf = 5, fcf ' = 1, and fcf > = 0. 

The word clique in network science equals the term 
complete subgraph in graph theory [16j , that is to say, 
the m order clique (m-clique for short) means a fully con- 
nected network with m nodes and m{m— 1)/2 edges. De- 
fine the m-clique degree of a node i as the number of dif- 
ferent rn-cliques containing i, denoted by fc^-™''. Clearly, 

2-clique is an edge, and fc) equals to the degree ki, thus 
the concept of clique-degree can be considered as an ex- 
tension of degree (see Fig. 1). We have calculated the 
clique-degree from order 2 to 5 for some representative 
networks. Figs. 2 to 8 show the clique-degree distribu- 
tions of 7 representative networks in logarithmic binning 
plots these are the Internet at Autonomous Sys - 

tems (AS) level [l9|, the Internet at routers level [20| . 
the metabolic network of P. aeruginosa [2l| . the World- 
Wide- Web |22|] , the collaboration network of mathemati- 
cians 12311 ■ the protein-protein interaction networks of 
yeast 24"], and the BBS friendship networks in University 
of Science and Technology of China (USTC) The 
slopes shown in those figures are obtained by using the 
maximum likelihood estimation [26l|. Tab. I summarizes 
the basic topological properties of those networks. 

Although the backgrounds of those networks are com- 
pletely different, they all display power-law clique-degree 
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FIG. 2: (color online) Clique-degree distributions of Internet 
at AS level from order 2 to 5, where fc'™' denotes the m- 
clique-degree and N(k^"^'') is the number of nodes with m- 
clique-degree fc''"^ In each panel, the marked slope of red 
line is obtained by using maximum likelihood estimation [2^ . 




10° 10' 10' 10° 10' 10° 10' 10° 10° 10' 10° 



^(4) 1^(5) 



FIG. 3: (color online) Clique-degree distributions of Internet 
at routers level. 

distributions. We have checked many examples (not 
shown here) and observed similar power-law clique- 
degree distributions. However, not all the networks 
can display higher order power-law clique-degree dis- 
tributions. Actually, only the relative large networks 
could have power-law clique-degree distribution with or- 
der higher than 2. For example, Ref. 21] reports 43 
different metabolic networks, but most of them are very 
small {N < 1000), in which the cliques with order higher 
than 3 are exiguous. Only the five networks with most 
nodes display relatively obvious power-law clique-degree 
distributions, and the case of P. aeruginosa is shown in 
Fig. 4. Note that, even for small-size networks, the high- 
order clique is abundant for some densely connected net- 
works such as technological collaboration networks [131 
and food webs [1^. However, since the average degree 
of majority of metabolic networks is less than 10, the 



FIG. 4: (color online) Clique-degree distributions of the 
metabolic network of P. aeruginosa 
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FIG. 5: (color online) Clique-degree distributions of the 
World-Wide Web. 
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FIG. 6: (color online) Clique-degree distributions of the col- 
laboration network of mathematicians. 
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FIG. 7: (color online) Clique-degree distributions of the 
protein- protein interaction networks of yeast. 




FIG. 8: (color online) Clique-degree distributions of the BBS 
friendship networks in University of Science and Technology 
of China. The blue points with error bars denotes the case of 
randomized network. 



high-order chques could not be expected with network 
size N < 1000. Furthermore, aU the empirical data show 
that the power-law exponent will decrease with the in- 
crease of clique order. This may be a universal property 
and can reveal some unknown underlying mechanism in 
network evolution. 

In order to illuminate that the power-law clique-degree 
distributions with order higher than 2 could not be con- 
sidered as a trivial inference of the scale-free property, we 
compare these distributions between original USTC BBS 
friendship network and the corresponding randomized 
network. Here the randomizing process is implemented 
by using the edge-crossing algorithm [H, [29, 30, 31], 
which can keep the degree of each node unchanged. 
The procedure is as follows: (1) Randomly pick two 
existing edges ei = xiX2 and 62 — such that 

2^1 7^ 2:2 ^ X3 7^ X4 and there is no edge between xi 
and X4 as well as X2 and x^. (2) Interchange these two 



TABLE II: The empirical (5™) and predicted {5'^) power- law 
exponent of clique-degree distribution, where 7 and a denote 
the power-law exponents of degree distribution and clustering- 
degree correlation. The symbol "/" denotes the cases with 
a{m — 2) > 2, leading to negative and meaningless S'^. 
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edges, that is, connect x\ and 2:4 as well as xi and xa, 
and remove the edges e\ and e-i. (3) Repeat (1) and (2) 
for lOM times. 

We call the network after this operation the random- 
ized network. In Fig. 9, we report the clique-degree dis- 
tributions in the randomized network. Obviously, the 
2-clique degree distribution (not shown) is the same as 
that in Fig. 8. One can find that the randomized net- 
work does not display power-law clique-degree distribu- 
tions with higher order, in fact, it has very few 4-cliques 
and none 5-cliques. The direct comparison is shown in 
Fig. 8. 

The discoveries of new topological properties of net- 
works infuse the network science with ozone P, H, 0, H, 
[s^j IS^j [3^ . These empirical studies not only reveal new 
statistical features of networks, but also provide useful 
criterions in judging the validity of evolution models (For 
example, the Barabasi- Albert model [3| does not display 
high order power-law clique-degree distributions.). The 
clique-degree, which can be considered as an extension 
of degree, may be useful in measuring the density of mo- 
tifs, such subunits not only plays a role in controlling the 
dynamic behaviors, but also refers the basic evolutionary 
characteristics. More interesting, we find various real-life 
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FIG. 9: (color online) The clique-degree distributions in the 
randomized network corresponding to the BBS friendship net- 
work of USTC. The black squares and red circles represent the 
clique-degree distributions of order 3 and 4, respectively. All 
the data points and error bars are obtained from 100 inde- 
pendent realizations. 

networks display power-law clique-degree distributions of 
decreasing exponent with the clique order. This is an in- 
teresting statistical property, and can provide a criterion 
in the studies of modelling. 

It is worthwhile to remind of a prior work [isl ] that 
reported a similar power-law distribution observed for 
some cellular networks. They divided all the subgraphs 
into two types, and claim that the power-law can only 
be found in TYPE I. Moreover, they have derived the 



analytical expression of the power-law exponent 5'^ for 
TO-clique degree distribution as d'^ = l-|-(7 — l)/[m — 
1 — a(rn — 1)(to — 2)/2], where a denotes the power-law 
exponent of clustering-degree correlation C{k) 
Tab. II displays the predicted power-law exponents 5^, 
compared with the empirical observation Sm- For the 
TYPE I cases, the predicted results are, to some extent, 
in accordance with the empirical data. More significant, 
here we offer an clear evidence that those power-laws 
can also be detected for TYPE II subgraphs, while Ref. 
[Tst claimed that the power law can not be observed for 
TYPE II cases. Note that, even the power law is de- 
tected for TYPE II cases, the analytical expression of 
(5,'„ loses its validity in those cases. The qualitative dif- 
ference in TYPE II cases and quantitative departure in 
TYPE I cases may be attributable to the structural bias 
(e.g. assortative connecting pattern [s^l, rich- club phe- 
nomenon [s^, etc.) since the derivation in Ref. [ij] is 
based on uncorrelated networks. In addition, the pre- 
dicted accuracy decreases as the increase of clique size 
m, because the clustering coefficient takes into account 
only the triangles (36j . Therefore, a more accurate analy- 
sis may involve higher order clustering coefficient [ZJ] . In 
a word, Ref. jl^] provides us a start point of in-depth 
understanding on network structure in clique level, while 
the diversity and complexity of real networks require fur- 
ther explorations on this issue. 

We thank Dr. Ming Zhao for the useful discussion. 
This work is supported by the National Natural Science 
Foundation of China under Nos. 10472116, 70471033, 
and 10635040. 



[1] D. J. Watts, et al, Nature 393, 440(1998). 

[2] A. -L. Barabasi, et al., Science 286, 509(1999). 

[3] R. Albert, et al, Rev. Mod. Phys. 74, 47(2002). 

[4] S. N. Dorogovtsev, et al, Adv. Phys. 51, 1079(2002). 

[5] M. E. J. Newman, SIAM Review 45, 167(2003). 

[6] S. Boccaletti, et al, Phys. Rep. 424, 175 (2006). 

[7] L. da F. Costa, et al., Adv. Phys. 56, 167 (2007). 

[8] L. A. N. Amaral, et al, Proc. Natl. Acad. Sci. U.S.A. 97, 

11149(2000). 
[9] R. Milo, et al. Science 298, 824(2002). 
[10] A. -L. Barabasi, et al., Nature Reviews Genetics 5, 

101(2004). 

[11] S. Itzkovitz, et al, Phys. Rev. E 68, 026127 (2003). 
[12] R. Milo, et al. Science 303, 1538(2004). 
[13] A. Vazquez, et al, Proc. Natl. Acad. Sci. U.S.A. 101, 
17940(2004). 

[14] Y. Artzy-Randrup, et al, Science 305, 1107c(2004). 

[15] R. Milo, et al.. Science 305, 1107d(2004). 

[16] I. Derenyi, et al, Phys. Rev. Lett. 94, 160202(2005). 

[17] J. Park, et al, Phys. Rev. E 68, 036122 (2003). 

[18] M. E. J. Newman, Contemporary Physics 46, 323 (2005). 

[19] Http: / / www.cosin.org/ extra/data/inter net / nlanr.html, 

[20] Http://www. isi.edu/scan/mercator/map. html 

[21] H. Jeong, ef^aE^TJaturi 407, 651(2000). 



[22] R. Albert, et al.. Nature 401, 130(1999). 
[23] Http:/ /www. oakland.edu/~grossmMi] 
[24] Http:/ /dip. doe-mbi.ucla.edu/ 

[25] This network is constructed based on the BBS of USTC, 
wherein each node represents a BBS accounts and two 
nodes are neighboring if one appears in the other one's 
friend-list. Only the undirected network is considered. 

[26] M. L. Goldstein, et al, Eur. Phys. J. B 41, 255 (2004). 

[27] P. -P. Zhang, et al, Physica A 360, 599 (2006). 

[28] S. L. Pimm, Food Webs (Univ. Chicage, Chicage, 2002). 

[29] S. Maslov, et al, Science 296, 910(2002). 

[30] B. J. Kim, Phys. Rev. E 69, 045101(R)(2004). 

[31] M. Zhao, et al., Physica A 371, 773 (2006). 

[32] M. E. J. Newman, Phys. Rev. Lett. 89, 208701(2002). 

[33] E. Ravasz, et al, Phys. Rev. E 67, 026112(2003). 

[34] C. Song, et al. Nature 433, 392(2005). 

[35] S. Zhou, et al., IEEE Comm. Lett. 8, 180(2004). 

[36] The theory in Ref. [lH is really accurate for ^3 if be- 
longs to TYPE I, for example, 83 in random Apollonian 
networks ^3 can be exactly predicted by the analytical 
result S'g. 

[37] T. Zhou, et al, Phys. Rev. E 71, 046141 (2005). 



